Learning a Value Analysis Tool for Agent Evaluation
نویسندگان
چکیده
Evaluating an agent’s performance in a stochastic setting is necessary for agent development, scientific evaluation, and competitions. Traditionally, evaluation is done using Monte Carlo estimation; the magnitude of the stochasticity in the domain or the high cost of sampling, however, can often prevent the approach from resulting in statistically significant conclusions. Recently, an advantage sum technique has been proposed for constructing unbiased, low variance estimates of agent performance. The technique requires an expert to define a value function over states of the system, essentially a guess of the state’s unknown value. In this work, we propose learning this value function from past interactions between agents in some target population. Our learned value functions have two key advantages: they can be applied in domains where no expert value function is available and they can result in tuned evaluation for a specific population of agents (e.g., novice versus advanced agents). We demonstrate these two advantages in the domain of poker. We show that we can reduce variance over state-of-the-art estimators for a specific population of limit poker players as well as construct the first variance reducing estimators for no-limit poker and multi-player limit poker.
منابع مشابه
A Petri-net based modeling tool, for analysis and evaluation of computer systems
Petri net is one of the most popular methods in modeling and evaluation of concurrent and event-based systems. Different tools have been created to support modeling and simulation of different extensions of Petri net in different applications. Each tool supports some extensions and some features. In this work a Petri net based modeling and evaluation tool is presented that not only supports dif...
متن کاملValidation of Lifelong Learning Tool for Public Librarians' Users in High Schools
Purpose: The purpose of this study is to validate the questionnaire on the lifelong learning readiness of high school users of public libraries. Methodology: This research is an applied and descriptive-correlational study. The statistical population consists of the high school users of public libraries from whom 201 students were selected by using the random sampling method. The questionnaire ...
متن کاملComparison of Open Source Learning Management Softwares and Presenting a Native Evaluation Tool
Introduction: Nowadays all educational institutes are trying to use technology in their structure. This effort has been faced with different barriers, including cost, time, and support. Therefore, using open source softwares can partially help us in using technology. In this article, we review main features of several open source learning management softwares, while presenting a tool which incl...
متن کاملNootropic Medicinal Plants; Evaluating Potent Formulation By Novelestic High throughput Pharmacological Screening (HTPS) Method
The principle of this method was to screen the pharmacological activity of six prepared polyphyto formulations by using high throughput screening method for their nootropic action. The study was performed in three stages using one, two and three animals, respectively in a group. Test formulations were given p.o daily at the dose of 50 and 100 mg/kg body weight. The test formulations were compar...
متن کاملAssessing the validity and reliability of the Persian version of the Isakson and et al. Reading Attitude Questionnaire
Purpose: This study Accomplished to evaluate the adequacy of psychometric properties and validate the Isaacson Reading Attitude Questionnaire. Isaacson et al. (2016) designed The Attitude Towards Academic Reading Questionnaire. Method: To validate this tool, apparent and content validity indices were measured. Also, confirmatory factor analysis was utilized to evaluate the validity of the stru...
متن کامل